20 research outputs found

    An integrated FLEx–ELAN workflow for linguistic analysis with multiple transcriptions and translations and multiple participants

    Get PDF
    This paper presents a workflow integrating the linguistic software ELAN and FLEx. This workflow allows the user to move between these two software applications to refine the transcription, translation, and annotation of the speech of multiple participants. The workflow also enables the addition of multiple writing systems for vernacular and analysis languages. The paper is based on a manual that explains in a simple and visual manner how to achieve such a set-up in both ELAN and FLEx. The workflow allows language consultants to make changes and additions to transcriptions and translations in ELAN in a script and language that they are most comfortable with. In this way, the workflow fills a gap where language consultants with limited computer literacy and command of the major interface languages of software programmes can still work on the basic analysis of recordings of a language that they know well.National Foreign Language Resource Cente

    Sartang (West Kameng district, Arunachal Pradesh, India) – Language Contexts

    Get PDF
    Sartang is a recently coined name for a Scheduled Tribe inhabiting four villages and their associated hamlets in West Kameng district of the state of Arunachal Pradesh in India. Sartang also refers to the four linguistic varieties that the people belonging to this Scheduled Tribe speak. Sartang is a Trans-Himalayan language belonging to the Western Kho-Bwa languages of the Kho-Bwa cluster. Because of low speaker numbers and rapid socio-economic developments in the area, Sartang may be considered vulnerable. This paper provides an initial overview of the four Sartang varieties, their purported origin, their history, their genetic classification, their contact languages, the language use, and attitudes, and two characteristic aspects of the Sartang culture

    Languages of the World – Kusunda.

    Get PDF

    The multiple benefits of making predictions in linguistics

    Get PDF
    Through an experiment on a Western Kho-Bwa linguistic dataset, Timotheus A. Bodt and Johann-Mattis List provide evidence for the regularity of sound change

    Computer-Assisted Language Comparison: State of the ArtW

    Get PDF
    Historical language comparison opens windows onto a human past, long before the availability of written records. Since traditional language comparison within the framework of the comparative method is largely based on manual data comparison, requiring the meticulous sifting through dictionaries, word lists, and grammars, the framework is difficult to apply, especially in times where more and more data have become available in digital form. Unfortunately, it is not possible to simply automate the process of historical language comparison, not only because computational solutions lag behind human judgments in historical linguistics, but also because they lack the flexibility that would allow them to integrate various types of information from various kinds of sources. A more promising approach is to integrate computational and classical approaches within a computer-assisted framework, “neither completely computer-driven nor ignorant of the assistance computers afford” [1, p. 4]. In this paper, we will illustrate what we consider the current state of the art of computer-assisted language comparison by presenting a workflow that starts with raw data and leads up to a stage where sound correspondence patterns across multiple languages have been identified and can be readily presented, inspected, and discussed. We illustrate this workflow with the help of a newly prepared dataset on Hmong-Mien languages. Our illustration is accompanied by Python code and instructions on how to use additional web-based tools we developed so that users can apply our workflow for their own purposes

    Computer-Assisted Language Comparison: State of the Art

    Get PDF
    Historical language comparison opens windows onto a human past, long before the availability of written records. Since traditional language comparison within the framework of the comparative method is largely based on manual data comparison, requiring the meticulous sifting through dictionaries, word lists, and grammars, the framework is difficult to apply, especially in times where more and more data have become available in digital form. Unfortunately, it is not possible to simply automate the process of historical language comparison, not only because computational solutions lag behind human judgments in historical linguistics, but also because they lack the flexibility that would allow them to integrate various types of information from various kinds of sources. A more promising approach is to integrate computational and classical approaches within a computer-assisted framework, “neither completely computer-driven nor ignorant of the assistance computers afford” [1, p. 4]. In this paper, we will illustrate what we consider the current state of the art of computer-assisted language comparison by presenting a workflow that starts with raw data and leads up to a stage where sound correspondence patterns across multiple languages have been identified and can be readily presented, inspected, and discussed. We illustrate this workflow with the help of a newly prepared dataset on Hmong-Mien languages. Our illustration is accompanied by Python code and instructions on how to use additional web-based tools we developed so that users can apply our workflow for their own purposes

    The Database of Cross-Linguistic Colexifications, reproducible analysis of cross-linguistic polysemies

    Get PDF
    Advances in computer-assisted linguistic research have been greatly infuential in reshaping linguistic research. With the increasing availability of interconnected datasets created and curated by researchers, more and more interwoven questions can now be investigated. Such advances, however, are bringing high requirements in terms of rigorousness for preparing and curating datasets. Here we present CLICS, a Database of Cross-Linguistic Colexifcations (CLICS). CLICS tackles interconnected interdisciplinary research questions about the colexifcation of words across semantic categories in the world's languages, and show-cases best practices for preparing data for cross-linguistic research. This is done by addressing shortcomings of an earlier version of the database, CLICS2, and by supplying an updated version with CLICS3, which massively increases the size and scope of the project. We provide tools and guidelines for this purpose and discuss insights resulting from organizing student tasks for database updatesTT, MSW, NES, YL, and JML were funded by the the ERC Starting Grant 715618 Computer-Assisted Language Comparison (http://calc.digling.org). SJG was supported by the Australian Research Council’s Discovery Projects funding scheme (project number DE 120101954) and the ARC Center of Excellence for the Dynamics of Language grant (CE140100041). MKT was supported by the Riksbankens Jubileums Fond (Grant SAB17-0588:1). TB was funded by the Swiss National Science Foundation, P2BEP1_181779, “Reconstruction of Proto-Western Kho-Bwa”
    corecore